Robust Natural Language Generation from Large-Scale Knowledge Bases

نویسندگان

  • Charles B. Callaway
  • James C. Lester
چکیده

We have begun to see the emergence of large-scale knowledge bases that house tens of thousands of facts encoded in expressive representational languages. The richness of these representations o er the promise of signi cantly improving the quality of natural language generation, but their representational complexity, scale, and task-independence pose great challenges to generators. We have designed, implemented, and empirically evaluated Fare, a functional realization system that exploits message speci cations drawn from large-scale knowledge bases to create functional descriptions, which are expressions that encode both functional information (case assignment) and structural information (phrasal constituent embeddings). Given a message speci cation, Fare exploits lexical and grammatical annotations on knowledge base objects to construct functional descriptions, which are then converted to text by a surface generator. Two empirical studies|one with an explanation generator and one with a qualitative model builder|suggest that Fare is robust, e cient, expressive, and appropriate for a broad range of applications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Large - Scale Knowledge Bases 1

In recent years, the natural language generation community has begun to mature rapidly and produce sophisticated oo-the-shelf surface realizers. A parallel development in the knowledge representation community has been the emergence of large-scale knowledge bases that house tens of thousands of facts encoded in expressive representational languages. Because of the richness of their representati...

متن کامل

Robust Natural Language Generationfrom Large - Scale Knowledge Bases 1

We have begun to see the emergence of large-scale knowledge bases that house tens of thousands of facts encoded in expressive representational languages. The richness of these representations ooer the promise of signiicantly improving the quality of natural language generation, but their repre-sentational complexity, scale, and task-independence pose great challenges to generators. We have desi...

متن کامل

Two-Level, Many-Path Generation

Large-scale natural language generation requires the integration of vast mounts of knowledge: lexical, grammatical, and conceptual. A robust generator must be able to operate well even when pieces of knowledge axe missing. It must also be robust against incomplete or inaccurate inputs. To attack these problems, we have built a hybrid generator, in which gaps in symbolic knowledge are filled by ...

متن کامل

Automatic extraction of facts, relations, and entities for web-scale knowledge base population

Equipping machines with knowledge, through the construction of machinereadable knowledge bases, presents a key asset for semantic search, machine translation, question answering, and other formidable challenges in artificial intelligence. However, human knowledge predominantly resides in books and other natural language text forms. This means that knowledge bases must be extracted and synthesiz...

متن کامل

A Repository of Frame Instance Lexicalizations for Generation

Robust, statistical Natural Language Generation from Web knowledge bases is hindered by the lack of text-aligned resources. We aim to fill this gap by presenting a method for extracting knowledge from natural language text, and encode it in a format based on frame semantics and ready to be distributed in the Linked Open Data space. We run an implementation of such methodology on a collection of...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995